Novel Unsupervised Feature Filtering of Biological Data
نویسندگان
چکیده
منابع مشابه
Novel Unsupervised Feature Filtering of Biological Data
MOTIVATION Many methods have been developed for selecting small informative feature subsets in large noisy data. However, unsupervised methods are scarce. Examples are using the variance of data collected for each feature, or the projection of the feature on the first principal component. We propose a novel unsupervised criterion, based on SVD-entropy, selecting a feature according to its contr...
متن کاملUnsupervised feature selection under perturbations: meeting the challenges of biological data
MOTIVATION Feature selection methods aim to reduce the complexity of data and to uncover the most relevant biological variables. In reality, information in biological datasets is often incomplete as a result of untrustworthy samples and missing values. The reliability of selection methods may therefore be questioned. METHOD Information loss is incorporated into a perturbation scheme, testing ...
متن کاملUnsupervised Feature Selection for Text Data
Feature selection for unsupervised tasks is particularly challenging, especially when dealing with text data. The increase in online documents and email communication creates a need for tools that can operate without the supervision of the user. In this paper we look at novel feature selection techniques that address this need. A distributional similarity measure from information theory is appl...
متن کاملRobust Unsupervised Feature Selection on Networked Data
Feature selection has shown its effectiveness to prepare high-dimensional data for many data mining and machine learning tasks. Traditional feature selection algorithms are mainly based on the assumption that data instances are independent and identically distributed. However, this assumption is invalid in networked data since instances are not only associated with high dimensional features but...
متن کاملUnsupervised Feature Learning from Temporal Data
Current state-of-the art object detection and recognition algorithms mainly use supervised training, and most benchmark datasets contain only static images. In this work we study feature learning in the context of temporally coherent video data. We focus on training convolutional features on unlabeled video data, using only the assumption that adjacent video frames contain semantically similar ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Bioinformatics
سال: 2006
ISSN: 1367-4803,1460-2059
DOI: 10.1093/bioinformatics/btl214